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ABSTRACT 

In this paper, we propose a novel age estimation method based 
on gradient location and orientation histogram (GLOH) de- 
scriptor and multi-task learning (MTL). The GLOH, one of 
the state-of-the-art local descriptor, is used to capture the age- 
related local and spatial information of face image. As the ex- 
tracted GLOH features are often redundant, MTL is designed 
to select the most informative GLOH bins for age estimation 
problem, while the corresponding weights are determined by 
ridge regression. This approach largely reduces the dimen- 
sions of feature, which can not only improve performance but 
also decrease the computational burden. Experiments on the 
public available FG-NET database show that the proposed 
method can achieve comparable performance over previous 
approaches while using much fewer features. 

Index Terms — Age estimation, GLOH feature, multi- 
task learning, ridge regression 

1. INTRODUCTION 

Within the past decade, automatic age estimation has become 
an active research topic due to its emerging new applications 
from human-computer interaction to security control, surveil- 
lance monitoring, biometrics, etc. For example, in automatic 
human computer interaction (HCI) applications if computers 
can determine the age of the user, both the content of com- 
puter and the type of interaction can be adjusted according 
to the age of the user. In security control and surveillance 
monitoring, the automatic age estimation system can prevent 
minors from drinking wine or purchasing tobacco. 

Aging is a very complicated process and is determined by 
both innate factors and environmental factors such as hered- 
ity, gender, health, and lifestyle, which make the automatic 
age estimation very challenging. Much works on age esti- 
mation problem has been undertaken in recent years. Two 
keys to these methods are face representation and age esti- 
mation d. Existing face representation techniques for age 
estimation often include the anthropometric models 0, ac- 
tive appearance models EHHO, aging pattern subspace (6j, 
age manifold QUID, local features such as local binary pat- 
terns (LBP) features [9 ], Gabor features fTOlL spatially fexible 



patch (SFP) (TT1, bio-inspired features (BIF) [Q21, etc. and the 
combination of them (l3j[T4l. Based on these face represen- 
tation, the age estimation can be performed by considering it 
as a classification problem [31 [4l [6) |9] 031 or a regression 
problem ElEllElElSlElora hybrid of two Q. It is well 
known that aging process shares a global trend but is specific 
to a given individual. Most of existing methods concern the 
building of global age estimator ||T0j [121 ITT) due to the lack of 
training data for each individual. There are also a few works 
care on the person specific age estimation OS] [6] [5]]. 

In this paper we propose a novel method based GLOH 
representation [ 15 ] and MTL feature selection [16 ] along with 
ridge regression for global age estimation. The basic idea is to 
use the state-of-the-art GLOH descriptor to represent the age- 
related local and spatial information in the face image and 
utilize a sparsity-enforced MTL to select the most informa- 
tive GLOH bins. The selected GLOH bins can be seen as a 
discriminant and compact face representation and are fed into 
ridge regressors to estimate the age. Fig. [T] illustrates the 
framework of our method. 

To the best of our knowledge, the GLOH descriptor and 
regularization based feature selection method are applied to 
age estimation research for the first time. We propose to use 
individual bins, instead of the whole histogram, of GLOH as 
feature for selection and estimation. We use the ridge regres- 
sion on the selected feature for estimation instead of using the 
original sparsity-enforced linear regression, which can avoid 
the underestimation problem of the coefficients induced by 
the sparsity-enforced linear regression. 

The rest of the paper is organized into several sections. In 
section [2l we describe the GLOH-based face representation. 
Sectionj3]details the sparsity-enforced multi-task feature (bin) 
selection and the ridge regression-based estimation. In sec- 
tion [4] we show the experimental results and analysis. Finally 
the section [5] concludes the paper. 

2. GLOH REPRESENTATION 

The GLOH local descriptor, originally introduced by Miko- 
lajczyk et al. fT5lL is designed to increase the robustness and 
distinctiveness of the well-known SIFT descriptor which inte- 
grates both local appearance and position information. Simi- 
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Fig. 1. The framework of the proposed method 



lar to SIFT descriptor or HOG descriptor, it is also based on 
evaluating well-normalized local histograms of image gradi- 
ent orientations in a dense grid. More specifically, the original 
GLOH descriptor can be obtained by computing the SIFT or 
HOG descriptor for a log-polar location grid with three radius 
and eight angles. The gradient orientations are then quantized 
into 16 parts and thus the resulting descriptor gives a 272-bins 
histogram. The size are reduced into 128 by PC A. 

In our implementation, the parameters of GLOH descrip- 
tors are tuned to make it more suitable for our age estimation 
application. More details will illustrate in section]?] In order 
to obtain the GLOH representation of the face image, we first 
divide each face image into patches with overlaps, and then 
compute the GLOH histogram for all patches independently. 
Finally, we concatenate all these GLOH histograms to a high 
dimensional GLOH histogram vector, which is the represen- 
tation for the image containing both the local texture feature 
and spatial information. Note that we don't perform PC A to 
reduce the GLOH dimensionality. 

As we extract the GLOH feature from patches with over- 
laps, they are redundant. However, only a relatively small 
fraction of them is relevant to the estimation task. So feature 
selection is a crucial and necessary step to select the most 
discriminant ones, which can not only improve the estima- 
tion performance but also decrease the computational burden. 
In the next section we will describe how to adopt the sparse- 
enforced regularized-based method for feature (bins) selec- 
tion. 

3. MULTI-TASK GLOH FEATURE SELECTION 

Assuming there is L tasks and the training set consists of sam- 
ples {(xj, yl) G X x y,i = 1 • • • Ni, I = 1 • • • L} where 
I indexes the tasks and i indexes the samples of each task, 
x G y\ K and y G ^R 1 are the GLOH feature and age label, 
respectively, and Ni is the sample size of task I. If we treat 
the training of each task independently, the feature selection 
can be formulated as a sparsity regularized regression on their 



age labels in terms of the GLOH bins 

1 Nl 

^ Wi Y^jWMM)+m\^\\i- c 1 ) 

Due to the small sample size of each task, such a indepen- 
dent feature selection often leads to overfitting, which can be 
combated by the following multi-task generalization fT6l 

1 Ni K 
min-^ JV, x{, yl) + A llw* || 2 , (2) 

1 i=l k=l 

where W = (w\) is the matrix with G Dl K in rows and 
Wfc in columns. 

In our implementation, we treat the age estimation of each 
gender as a task, since there is a significant difference in the 
timing and types of facial growth between men and women. 
In addition, we restrict ourselves to the case of a age regres- 
sion model where the age is linear in the GLOH bins and then 
the loss function is given by 

JV,x^) = ||^-<x^w ( >||2. (3) 

We argue that linear methods are more preferred than non- 
linear ones due to the much faster training and testing speed 
and significantly less memory requirements, especially in the 
cases involving tens of thousands of samples with dimension- 
ality of tens of thousands. 

Notice that the optimization problem ^ is a non-smooth 
problem and in (T6j, the block-coordinate descent method is 
proposed to solve it directly. However, the block-coordinate 
descent is an iterative procedure which may converge slowly. 
In our implementation, we adopt the accelerated algorithm in 
fTTl which reformulates it as two equivalent smooth convex 
optimization problems which are then solved via an optimal 
first-order black-box method for smooth convex optimization. 

Recalled that the above feature selection frame yields both 
the selected feature bin indices and the corresponding coeffi- 
cients and thus can be used for estimation directly. However, 



one can also consider its usage as a pure feature selection tool 
and adopt some other common classifiers or regression meth- 
ods for estimation. Experientially, the above feature selection 
frame often underestimates the coefficients and thus often can 
not achieve satisfied performance. We adopt the ridge regres- 
sion method on the selected feature bins to alleviate this prob- 
lem. 

4. EXPERIMENTAL RESULTS 

We carry experiments on the FG-NET aging database lfT8l to 
verify the proposed age estimation method. The database in- 
cludes 1, 002 images (82 persons) age ranging from to 69. 
First, we align all images into the mean shape, the aligned 
face images are scale to the size of 68 x 62. During the 
GLOH feature extraction step, the size of image patch is set 
as 10 x 10. For each image patch, we use 3 radius {2,3,5} 
and the 8 gradient directions used in each image patch, so the 
dimension of the resulting histogram vector is 136. By con- 
catenating all patches histogram vector, we obtain a 48,960- 
dimensional original GLOH feature vector. It contains both 
the local texture and space information of the face image. The 
sparsity-enforced feature selection are applied to these high- 
dimensional GLOH features and no more than 50 bins are 
often selected in our experiments for the age estimation. 

First we following the leave one person out (LOPO) rules 
in (51 [13. For each fold, all the images of one person are set 
aside as the test set and those of the others are used as the 
training set to simulate the situation in real applications. The 
mean absolute error (MAE) is adopted as the performance 
measures. We also implement the single task learning (STL)- 
based methods. The result as showed in Table Q] Note that 
our method perform better than other methods except BIF- 
based method |12] and the MTL-based method is superior 
than the STL-based method. In order to perform a fair com- 
parison with the BIF-based, we re-implement the method in 
fl2l . where BIF is extracted by the code and the regres- 
sion through the code of LIBSVM^] Other parameters are set 
same as fl2\ . Although the MAE of their method shows in 
the paper reach 4.77, we just get 7.79. This difference may 
due to the different pre-process steps. Since the BIF is the 
state-of-the-art features for age estimation, we further com- 
pare the efficiency of BIF and GLOH feature in the same 
framework with different regressors and using PC A (keeping 
98% energy) to reduce the dimensionality. Table [2] lists the 
comparative results, which shows that GLOH performs com- 
parable or even better than BIF in age estimation. Moreover, 
the MTL-based dimensionality reduction performs much bet- 
ter than PCA. 

Second, following the protocol in [8], we select 854 im- 
ages with ages from to 30 years (499 males and 355 fe- 
males) as done in 0. The performance is reported by cross- 

1 http://cbcl.mit.edu/software-datasets/standardmodel/index.html 
2 http://www.csie.ntu.edu.tw/~cjlin/libsvm/ 



Table 1. Prediction errors (in MAE) of different algorithms 
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Table 2. Comparative performance of BIF and GLOH for 
age estimation with different dimensionality reduction tools 
and regressors 



Method 


MAE 


BIF+PCA+Ridge regressor 
BIF+PCA+SVR 


8.81 
7.79 


GLOH+PCA+Ridge regressor 
GLOH+PCA+SVR 

GLOH+MTL+Ridge regressor (our method) 


8.86 
7.36 
5.45 



validation method. The whole process is repeated by leave- 
one-out mode as the same in (8). We compare the result 
of our method with the reported methods in [8]. Table 3 
summarizes the results based on the MAE. Our method per- 
forms better than others again. In addition to the MAE mea- 
sures, we also explore the cumulative score as the perfor- 
mance measure. Figure 3 illustrates the comparative perfor- 
mance in terms of the cumulative accuracy which shows that 
our method performs better than the other two methods con- 
sistently and achieves a 96% accuracy rate on the 10-year tol- 
erant error. 

5. CONCLUSIONS 

In this paper, we have proposed a novel age estimation frame- 
work based on GLOH feature and MTL. By using GLOH fea- 
ture to represent face image and using multi-task learning to 
select features, we can select a few informative feature bins 
for age estimation. Ridge regression was adopted to confirm 
the weights of the selected feature bins. With them, we ob- 
tain an age regression model, the method takes advantages of 
low-dimension, high discriminative power and favorable per- 
formance over previous approaches. 
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Table 3. The comparative performance in terms of MAE us- 
ing the protocol in (8) 
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